A Corrective Learning Approach for Text-independent Speaker Verification

نویسندگان

  • Yandong Wen
  • Tianyan Zhou
  • Rita Singh
  • Bhiksha Raj
چکیده

We present a conceptually plausible approach for textindependent speaker verification (TISV) which treats speech recordings as a collection of segments providing incremental evidence. This approach, called corrective learning, gradually improves an initial prediction of speaker identity based on incoming speech and the latest prediction. Specifically, we propose deep corrective learning networks (CLNets) that explicitly learn a mapping from a new speech segment and the current predictions, to a correction. Intuitively, the predictions eventually converge to the ground truth after several corrections. Trained on NIST SRE datasets, CLNets outperform current CNN and the i-vector baselines. Moreover, CLNets and i-vectors are complementary, and their fusion leads to significant performance improvements compared to what can be achieved by each of them individually.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities

In this paper we present a fusion methodology for combining prompted text-dependent and text-independent speaker verification operation modalities. The fusion is performed in score level extracted from GMM-UBM single mode speaker verification engines using several machine learning algorithms for classification. In order to improve the performance we apply clustering of the score-based data befo...

متن کامل

A text-independent speaker verification system using support vector machines classifier

In the recent years the technology for speaker verification or call authentication has received an increasing amount of attention in IVR industry. However due to the complexity of speaker information embedded in the speech signals the current technology still can not produce the verification accuracy to meet the requirement for some applications. In this paper we introduce a new pattern classif...

متن کامل

Deep Speaker Vectors for Semi Text-independent Speaker Verification

Recent research shows that deep neural networks (DNNs) can be used to extract deep speaker vectors (d-vectors) that preserve speaker characteristics and can be used in speaker verification. This new method has been tested on text-dependent speaker verification tasks, and improvement was reported when combined with the conventional i-vector method. This paper extends the d-vector approach to sem...

متن کامل

Multi-task learning for text-dependent speaker verification

Text-dependent speaker verification uses short utterances and verifies both speaker identity and text contents. Due to this nature, traditional state-of-the-art speaker verification approaches, such as i-vector, may not work well. Recently, there has been interest of applying deep learning to speaker verification, however in previous works, standalone deep learning systems have not achieved sta...

متن کامل

Bayesian Approach to Text-independent Speaker Verification

In this paper, we propose a novel approach to speaker verification. One of the problems in conventional speaker verificaion techniques based on the likelihood ratio test (LRT) is that the detection performance varies widely for each hypothesized speaker when the decision threshold is held fixed. In order to cope with the problem, we incorporate the distribution of the log likelihood ratio (LLR)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018